Convolutional Neural Networks (CNNs)

A convolutional neural network (CNN, or ConvNet) is a class of deep neural networks, most commonly applied to analyzing visual imagery. They are also known as shift invariant or space invariant artificial neural networks, based on their shared-weights architecture and translation invariance characteristics

Resources

Convolutions

1x1 convolutions

Human pose estimation and activity recognition

Code

Channel/visual attention

References

Sequence (time series) modelling

Object classification, image recognition

See AI/Computer Vision/Object classification, image recognition

Semantic segmentation

See AI/Computer Vision/Semantic segmentation

Object detection

See AI/Computer Vision/Object detection

Video segmentation and prediction

See AI/Computer Vision/Video segmentation and prediction

Image and video captioning

See AI/Computer Vision/Image and video captioning

Image-to-image translation

See AI/Computer Vision/Image-to-image translation

Super-resolution

See AI/Computer Vision/Super-resolution#Supervised CNN-based

Inpainting

See AI/Computer Vision/Inpainting and restoration#CNN-based

Background subtraction, foreground detection

See AI/Computer Vision/Background subtraction#CNN based

Edge detection

Human pose estimation and activity recognition

Motion detection, tracking

Deconvolution

Visual/Channel attention and Saliency

See AI/XAI#Explainability methods for Neural Networks

Spherical CNNs

See AI/Deep Learning/Spherical CNNs